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Abstract We study the Monadic Second Order (MSO) Hierarchy over 
infinite pictures, that is tilings. We give a characterization of existential 
MSO in terms of tilings and projections of tilings. Conversely, we charac- 
terise logic fragments corresponding to various classes of infinite pictures 
(subshifts of finite type, sofic subshifts). 



1 Introduction 

There is a close connection between words and monadic second-order (MSO) 
logic. Biichi and Elgot proved for finite words that MSO-formulas correspond 
exactly to regular languages. This relationship was developed for other classes 
of labeled graphs; trees or infinite words enjoy a similar connection. See [20,13] for 
a survey of existing results. Colorings of the entire plane, i.e tilings, represent 
a natural generalization of biinfinite words to higher dimensions, and as such 
enjoy similar properties. We plan to study in this paper tilings for the point of 
view of monadic second-order logic. 

Tilings and logic have a shared history. The introduction of tilings can be 
traced back to Hao Wang [21], who introduced his celebrated tiles to study the 
(un)decidability of the V3V fragment of first order logic. The undecidability of the 
domino problem by his PhD Student Berger [3] lead then to the undecidability 
of this fragment [5]. Seese [10,18] used the domino problem to prove that graphs 
with a decidable MSO theory have a bounded tree width. Makowsky[12,15] used 
the construction by Robinson [16] to give the first example of a finitely axiom- 
atizable super-stable theory that is super-stable. More recently, Oger [14] gave 
generalizations of classical results on tilings to locally finite relational structures. 
See the survey [2] for more details. 

Previously, a finite variant of tilings, called tiling pictures, was studied [6,7]. 
Tiling pictures correspond to colorings of a finite region of the plane, this region 
being bordered by special '#' symbols. It is proven for this particular model that 
language recognized by EMSO-formulas correspond exactly to so-called finite 
tiling systems, i.e. projections of finite tilings. 
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The equivalent of finite tiling systems for infinite pictures axe so-called sofic 
subshifts [22] . A sofic subshift represents intuitively local properties and ensures 
that every point of the plane behaves in the same way. As a consequence, there 
is no general way to enforce that some specific color, say q appears at least once. 
Hence, some simple first-order existential formulas have no equivalent as sofic 
subshift (and even subshift). This is where the border of # for finite pictures 
play an important role: Without such a border, results on finite pictures would 
also stumble on this issue. 

We deal primarily in this article with subshifts. See [1] for other acceptance 
conditions (what we called subshifts of finite type correspond to A-acceptance 
in this paper). 

Finally, note that all decision problems in our context are non-trivial : To 
decide if a universal first-order formula is satisfiable (the domino problem, pre- 
sented earlier) is not recursive. Worse, it is i7|-hard to decide if a tiling of the 
plane exists where some given color appears infinitely often [9,1]. As a conse- 
quence, the satisfiability of MSO-formulas is at least Z'J-hard. 

2 Symbolic Spaces and Logic 

2.1 Configurations 

Let d > 1 be a fixed integer and consider the discrete lattice Z''. For any finite 
set Q, a Q-configuration is a function from Z"^ to Q. Q may be seen as a set 
of colors or states. An element of Z"^ will be called a cell. A configuration will 
usually be denoted C, M or N. 

Fig. 1 shows an example of two difi'erent configurations of over a set Q of 5 
colors. As a configuration is infinite, only a finite fragment of the configurations is 
represented in the figure. The reader has to use his imagination to decide what 
colors do appear in the rest of the configuration. We choose not to represent 
which cell of the picture is the origin (0, 0) (we use only translation invariant 
properties). 

A pattern is a partial configuration. A pattern P : X ^ Q where X C Z^ 
occurs in C eQ^ at position zo if 

yzeX, C{zo + z)=P{z). 

We say that P occurs in C if it occurs at some position in C. As an example 
the pattern P of Fig 2 occurs in the configuration M but not in N (or more 
accurately not on the finite fragment of N depicted in the figure). A finite pattern 
is a partial configuration of finite domain. All patterns in the following will be 
finite. The language £{C) of a configuration C is the set of finite patterns that 
occur in C. We naturally extend this notion to sets of configurations. 

A subshift is a natural concept that captures both the notion of uniformity 
and locality: the only description "available" from a configuration C is the finite 
patterns it contains, that is C{C). Given a set of patterns, let Xj^ be the set 
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Figure 1. Two configurations 

Figure 2. A pattern P. P appears in M but presumably not in A'^ 

of all configurations where no patterns of J- occurs. 

X:F^{C\C{C)C^T = %} 

T is usually called the set of forbidden patterns or the forbidden language. A set 
of the form Xyr is called a subshift. 

A subshift can be equivalentely defined by topology considerations. Endow 
the set of configurations with the product topology: A sequence (C„)„gN of 
configurations converges to a configuration C if the sequence ultimately agree 
with C on every z ^ 7?. Then a subshift is a closed subset of also closed by 
shift maps. 

A subshift of finite type (or tiling) correspond to a finite set J-: it is the set 
of configurations C such that no pattern in J- occurs in C. If all patterns of J- 
are of diameter n, this means that we only have to see a configuration through 
a window of size n to know if it is a tiling, hence the locality. 

Given two state sets Qi and Q2, a projection is a map tt : Qi — > (52- We 
naturally extend it to tt : Qf — > Qf by 7r(C)(z) = tt{C(z)). A sofic subshift of 
state set Q2 is the image by some projection tt of some subshift of finite type of 
state set Qi. It is also a subshift (clearly closed by shift maps, and topologically 
closed because projections are continuous maps on a compact space). A sofic 
subshift is a natural object in tiling theory, although quite never mentioned 
explicitly. It represents the concept of decoration: some of the tiles we assemble 
to obtain the tilings may be decorated, but we forgot the decoration when we 
observe the tiling. 



2.2 Structures 



From now on, we restrict to dimension 2. A configuration will be seen in this 
article as an infinite structure. The signature t contains four unary maps North, 
South, East, West and a predicate Pc for each color c G Q. 

A configuration M will be seen as a structure DJl in the following way: 

— The elements of 9Jl are the points of Z^. 

— North is interpreted by North^((a;, y)) — {x,y + 1), East is interpreted by 
East™{{x,y)) = {x + 1, y). South™ and West™ arc interpreted similarly 

— P™((x, y)) is true if and only if the point at coordinate {x,y) is of color c, 
that is if M{x,y) — c. 

As an example, the configuration M of Fig. 1 has three consecutive cells with 
the color q. That is, the following formula is true: 



As another example, the following formula states that the configuration has 
a vertical period of 2 (the color in the cell {x,y) is the same as the color in the 
cell {x, y + 2)). The formula is false in the structure 9Jl and true in the structure 
VI (if the reader chose to color the cells of N not shown in the picture correctly): 



2.3 Monadic Second-Order Logic 

This paper studies connection between subshifts (seen as structures as explained 
above) and monadic second order sentences. First order variables (x, y, z, ...) 
are interpreted as points of 7? and (monadic) second order variables {X, Y, Z, 
...) as subsets of Z^. 

Monadic second order formulas are defined as follows: 

— a term is either a first-order variable or a function (South, North, East, West) 
applied to a term ; 

— atomic formulas are of the form ti = t2 or X{ti) where ti and ^2 are terms 
and X is either a second order variable or a color predicate ; 

— formulas are build up from atomic formulas by means of boolean connectives 
and quantifiers 3 and V (which can be applied either to first-order variables 
or second order variables). 

A formula is closed if no variable occurs free in it. A formula is FO if no 
second-order quantifier occurs in it. A formula is EMSO if it is of the form 
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3Xi,...,3X„,0(X) 



where is FO. Given a formula (piXi, . . . ,X„) with no free first-order variable 
and having only Xi, . . . ,X„ as free second-order variables, a configuration M 
together with subsets ...,£"„ is a model of (j){Xi, . . . , Xn), denoted 



if (f> is satisfied (in the usual sense) when M is interpreted as 971 (see previous 
section) and Ei interprets Xi. 

2.4 Definability 

This paper studies the following problems: Given a formula </> of some logic, what 
can be said of the configurations that satisfy </)? Conversely, given a subshift, 
what kind of formula can characterise it? 

Definition 2.1 A set S of Q- configurations is defined by (j) if 



Two formulas 4> and (j)' are equivalent iff they define the same set of config- 
urations. 

A set S is C-definable if it is defined by a formula (j) ^ C. 

Note that a definable set is always closed by shift (a shift between 2 configura- 
tions induces an isomorphism between corresponding structures). It is not always 
closed: The set of {q, Qj-configurations defined by the formula (j) ■ 3z, -Pq(-z) con- 
tains all configurations except the all-white one, hence is not closed. 

When we are dealing with MSO formulas, the following remark is useful: 
second-order quantifiers may be represented as projection operations on sets of 
configurations. We formalize now this notion. 

If TT : Qi 1-^ is a projection and is a set of Qi-configurations, we define 
the two following operators: 



Note that ^ is a dual of E, that is A{Tr){S) — ''E{tt){'^S) where represents 
complementation. 

Proposition 2.2 

— A set S of Q- configurations is EMS 0- definable if and only if there exists a 
set S' of Q' configurations and a map n : Q' t-^ Q such that S — E{t:){S') 
and S' is FO-definable. 

— The class of MSO-definable sets is the closure of the class of FO-definable 
sets by the operators E and A. 
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Proof (Sketch). We prove here only the first item. 



Let (j) = 3X, ^ be a EMSO formula that defines a set S of Q-configurations. 
Let Q' = Q X {0, 1} and tt be the canonical projection from Q' to Q. 
Consider the formula ip' obtained from ^ by replacing X{t) by VceQ-P(c,i)(t) 
andPe(t) byP(,,o)W VP(c,i)(t). 

Let S' be a set of Q' configurations defined by ip'. Then is it clear that 
S ~ E{Tr){S'). The generalization to more than one existential quantifier is 
straightforward. 

Let S = E{'7t){S') be a set of Q configurations, and S' FO-definable by the 

formula (p. Denote by ci . . . c„ the elements of Q' . Consider the formula (p' 
obtained from </> where each Pc- is replaced by Xi. Let 



i, = 3X,,...,3Xn,{ 



fVz,V,X,(z) 
yz,Ai^j{^Xi{z)V^Xj{z)) 



Then ip defines S. Note that the formula ijj constructed above is of the form 
3Xi, BXniyz, -ip'iz)) A (j)'. This will be important later. □ 

Second-order quantifications will then be regarded in this paper either as 
projections operators or sets quantifiers. 



3 Hanf Locality Lemma and EMSO 

The first-order logic has a property that makes it suitable to deal with tilings 
and configurations: it is local. This is illustrated by Hanf's lemma [8,4,11]. 

Definition 3.1 Two Q- configurations M and N are {n, k)-equivalent if for each 
Q -pattern P of size n: 

— If P appears in M less than k times, then P appears the exact same number 
of times in M and in N 

— If P appears in M more than k times, then P appears in N more than k 

times 

This notion is indeed an equivalence relation. Given n and k, it is clear that 
there is only finitely many equivalence classes for this relation. 

The Hanf's local lemma can be formulated in our context as follows: 

Theorem 3.2 For every FO formula (j), there exists (n, k) such that 

if M and N are (n, k) equivalent, then 071 1= ^ <^=^ (j) 

Corollciry 3.3 Every FO-definable set is a (finite) union of some {n, k)- equivalence 
classes. 



This is theorem 3.3 in [7], stated for finite configurations. Lemma 3.5 in the same 
paper gives a proof of Hanf's Local Lemma in our context. 

Given (P, k) we consider the set S=k (-P) of all configurations such that the 
pattern P occurs exactly k times {k may be taken equal to 0). The set S>k{P) is 
the set of all configurations such that the pattern P occurs more than k times. 

We may rephrase the preceding corollary as: 

Corollary 3.4 Every FO-definahle set is a positive combination (i.e. unions 
and intersections) of some S=k{P) and some S>k{P) 

Theorem 3.5 Every EMSO-definable set can be defined by a formula (j) of the 
form: 



where and 02 are quantifier-free formulas. 

See [20, Corollary 4.1] or [19, Corollary 4.2] for a similar result. This result is an 
easy consequence of [17, Theorem 3.2] (see also the corrigendum). We include 
here a full proof. 

Proof. Let C be the set of such formulas. We proceed in three steps: 

— Every EMSO-definable set is the projection of a positive combination of 
some S=k{P) and S>k{P) (using prop. 2.2 and the preceding corollary) 

— Every S={P,k) (resp. S>{P,k)) is C-definable 

— C-definable sets are closed by (finite) union, intersection and projections. 

C-definable sets are closed by projection using the equivalence of prop. 2.2 in 
the two directions, the note at the end of the proof and some easy formula 
equivalences. The same goes for intersection. 

Now we prove that C-definable sets are closed by union. The difficulty is to 
ensure that we use only one universal quantifier. Let (f> and (j)' be two C-formulas 
defining sets Si and 52. We can suppose that (p and </>' use the same numbers of 
second-order quantifiers and of first-order existential quantifiers. 

Then the formula 



3Xi 



, . . . , 3Xn, {\fZi, 0i(zi, Xi, . . . ,Xn)) 

A {3zi, 3Zp, (j)2{zi...Zp,Xi,..., Xn)) , 



3X,3Xi,...,3X„,V^i, 




X{zi) X(North(zi)) 
Xizi) ^ X(East(zi)) 
X{zi) =^ 0i(zi,Xi...X„) 

^x{zi) =^ 0;(zi,xi...x„) 




w X{zi) A(j)2{Zi...Zp,Xi...Xn) 
V ^X(;2i)A0^(zi...Zp,Xi...X„) 



defines U S2 (the disjunction is obtained through variable X which is forced 
to represent either the empty set or the whole plane Z^). 



It is now sufficient to prove that a S'=fc(P) set (resp. a S>k{P) set) is definable 
by a C- formula. Let (l>p{z) be the quantifier- free formula such that 4'p{z) is true 
if and only if P appears at position z. 

Then S=k{P) is definable by 



3Xi...3Xfc3^i,...,3Afc,Vx 



AiAi{x) ^ [Ai(North(x)) A^i(East(x))] 
^iXi{x) [Ai{x) A ^Ai{South{x)) A ^Ai{\Nest{ 

A,^jX,{x) =^ -Xj(.t) 
{ViXiix)) ^ <j)p{x) 

A 3zi, 3zk, Xi{zi) A • • • A Xkizk) 



The formula ensures indeed that Ai represents a quarter of the plane, Xi being a 
singleton representing the corner of that plane. If = this becomes Vx, ^(pp{x). 
To obtain a formula for S>k{P), change the last <^=^ to a => in the formula. 

□ 



4 Logic Characterization of SFT and Sofic Subshifts 

We start by a characterization of subshifts of finite type (SFTs, i.e tilings). The 
problem with SFTs is that they are closed neither by projection nor by union. 
As a consequence, the corresponding class of formulas is not very interesting: 

Theorem 4.1 A set of configurations is a SFT if and only if it is defined by a 
formula of the form 

where ip is quantifier-free. 

Note that there is only one quantifier in this formula. Formulas with more than 
one universal quantifier do not always correspond to SFT: This is due to SFTs 
not being closed by union. 

Proof. Let Pi . . . P„ be patterns. To each Pi we associate the quantifier-free 
formula (j)p- (z) which is true if and only if Pi appears at the position z. Then 
the subshifts that forbids patterns Pi . . . P„ is defined by the formula: 

V^,-.(/)pj(^) A---A-.(^P„(2) 

Conversely, let ip he a quantificr-frcc formula. Each term ti in ijj is of the 
form fi{z) where fi is some combination of the functions North, South, East and 
West, each fi thus representing somehow some vector Zi {fi{z) = z + Zi). Let 
Z be the collection of all vectors Zi that appear in the formula ijj. Now the fact 
that ^ is true at the position z only depends on the colors of the configurations 
in points {z + z\), . . . ,{z -\- Zn), i.e. on the pattern of domain Z that occurs at 
position z. Let P be the set of patterns of domain Z that makes ip false. Then 
the set S defined by ijj is the set of configurations where no patterns in P occurs, 
hence a SFT. □ 



Theorem 4.2 A set S is a sofic subshift if and only if it is definable by a formula 
of the form 



where ^ is quantifier-free. Moreover, any such formula is equivalent to a formula 
of the same form but with a single universal quantifier (p = 1). 

Note that the real difficulty in the proof of this theorem is to treat the only 
binary predicate, the equality (=). The reader might try to find a sofic subshift 
corresponding to the following formula before reading the proof: 



Proof. A sofic subshift being a projection of a SFT, one direction of the first 
assertion follows from the previous theorem and proposition 2.2. 
Let C be the class of formulas of the form: 



Now we prove by induction on the number p of universal quantifiers that 
each formula of C is equivalent to a formula with only one universal quantifier. 
There is nothing to prove for p = 1. 

First, we rewrite the formula in conjunctive normal form: 



where V'l is disjunctive. This is equivalent to 

3Xi, . . . ,3X„, AjVzi, . . . ,Vzp, i>i{Xi, . . . ,X„, zi...Zp) = 3Xi, . . . ,3X„, A^rjt 
Now we treat each rji separately. -0^ is a disjunction of four types of formulas: 



because terms are made only of bijective functions (compositions of North, South, 
East, West). We may suppose the last case never happens: yx,y,zf{x) ^ y V 
y, z) is equivalent to Vx, z, f{^)i We may rewrite 

'4!i{zi ...Zp) = e{zp) W Zp^ f{zki) W ■ --W Zp^ .f{zk^) V 0{zi . . .Zp-i) 

(we forgot the second-order variables to simplify notations) 

We may suppose that no formula is of the form Zp — Zp. Now is the key 
argument: Suppose that there are strictly more that m values of z such that 
e{z) is false. Then given zi . . . we may find a Zp such that the formula 
€{Zp) V {zp = f{zki)) V • • • V (zp = f{zk^)) is false. That is, if there are more than 
m values of z so that e(z) is false, then 



3Xi, 3X„, Vzi, . . . , Vzp, tp{Xi, Xn, zi... Zp) 




3Xi, . . .,3Xn,Wzi, . . .,'iZp, ^{Xi, . . .,Xn,Zi . . . Zp.) 



3Xi, . . . ,3X„,Vzi, . . . ,Vzp, Ai^pi{Xi 



>Pc{f{x)) — Pc(/(x)) •f{x)^y •f{x)^y 



Vzi, . . . ,Vz) 



1, 9{zi . ..Zp-i) 



must be true. 

As a consequence, our formula ry, is equivalent to the disjunction of the 
formula 

V^i, . . . ,Vzj,_i, 9{zi...Zp-i) 

and the formula 

r 

3Su...,3SmA Vz,VA{z) ^ -e(z) 

[ V^l, . . . ,V^p_i, 5i(/(^feJ) V • • • V SmifiZkJ) V ^(^1 . . . Zp_i) 

where express that Si has at most one element and is defined as follows: 

* ''^^'lAV-ri A{x) ^ A{North{x))AA{East{x)) 

' ~ ' \Si{x) <^ A{x) A ^A{South{x)) A ^A{\Nest{x)) 

Simplifying notations, our formula rji is equivalent to 

\/zi, . . . ,yzp-i, 6{zi . . . Zp-i) V 3Pi, . . . , BPg^/zi, . . .,\/zp-i, C,{zi . . . Zp-i) 

which is equivalent to 



3X,3Pi,...,3P,Vzi,...,Vzp_i, 




X(North(zi)) 
X(East(2i)) 

9{zi, . . . 

C{zi,...Zp-i) 



Now report this new formula instead of rn to obtain a formula 

3Xi, . . . , 3Xn, Ai3Ri, . . . , 3Rq.,yzi, . . . ,Vzp-i, 6i{zi . . . Zp^,Ri . . . Rq^) 
equivalent to 

3Xi, . . . , 3Xn, 3i?ii, . . . , 3Rkq^,\/zi, . . . ,\/Zp-i, Ai6i{zi . . . Zp^,Rii . . . Riqi) 

We finally obtain a formula of C with p — 1 universal quantifiers, and we may 
conclude by induction. 

To finish the proof, a formula with only one universal quantifier 

3Xu...,3Xnyz,e{z) 

defines indeed a sofic subshift (use the proof of theorem 4.1 to conclude that this 
formula defines a projection of a SFT, hence a sofic subshift) □ 

5 Separation Result 



Theorems 3.5 and 4.2 above suggest that EMSO-dcfinablc subshifts arc not 
necessarily sofic. We will show in this section that the set of EMSO-definable 
subshifts is indeed strictly larger than the set of sofic subshifts. The proof is 



based on the analysis of the computational complexity of forbidden languages. 
It is well-known that sofic subshifts have a recursively enumerable forbidden 
language. The following theorem shows that the forbidden language of an MSO- 
definable subshift can be arbitrarily high in the arithmetical hierarchy. 

This is not surprising since arbitrary Turing computation can be defined via 
first order formulas (using tilesets) and second order quantifiers can be used to 
simulate quantification of the arithmetical hierarchy. However, some care must 
be taken to ensure that the set of configurations obtained is a subshift. 

Theorem 5.1 Let E he an arithmetical set. Then there is an MSO-definable 
subshift with forbidden language T such that E reduces to T (for many-one 

reduction) . 

Proof (sketch). Suppose that the complement of is defined as the set of integers 
m such that: 

3xi,^X2, 3/Va;„, R{m, xi,...,Xn) 

where i? is a recursive relation. We first build a formula defining the set 
of configurations representing a successful computation of R on some input 
m,xi, . . . ,Xn. Consider 3 colors ci, c and Cr and additional second order vari- 
ables Xi, . . . , Xn and Si, . . . ,Sn- The input (m, Xi,. . . , a;„) to the computation 
is encoded in unary on an horizontal segment using colors c; and and variables 
Si as separators, precisely: first an occurrence of cj then m occurrences of c, then 
an occurrence of Cr and, for each successive 1 < i < n, Xi positions in Xi before 
a position of Si . Let be the FO formula expressing the following: 

1. there is exactly 1 occurrence of c; and the same for and all Si are single- 
tons; 

2. starting from an occurrence q and going east until reaching 5'„, the only 
possible successions of states are those forming a valid input as explained 
above. 

Now, the computation of R on any input encoded as above can be simulated via 
tiling constraints in the usual way. Consider sufficiently many new second order 
variables Yi, . . . ,Yp to handle the computation and let ^2 be the FO formula 
expressing that: 

1. a valid computation starts at the north of an occurrence of q; 

2. there is exactly one occurrence of the halting state (represented by some Yi) 
in the whole configuration. 

We define ^ by: 

3X1, VX2, . . . , 3/VX„, 35i, . . . , 3Sn, 3^1, . . . , 3Yp, cf>i A .^2. 

Finally let V' be the following FO formula: (V2;, -iPci) V (V^;, -^Pc^). Let X be the 
set defined hy ip. By construction, a finite (imidimensional) pattern of the 
form cic"^Cr appears in some configuration of X if and only \im ^ E. Therefore 
E is many-one reducible to the forbidden language of X. 



To conclude the proof it is sufficient to check that X is closed. To see this, 
consider a sequence (C„)„ of configurations of X converging to some configura- 
tion C. C has at most one occurrence of q and one occurrence of Cr- If one of 
these two states does not occur in C then C G X since if) is verified. If, conversely, 
both ci and Cp occur (once each) then any pattern containing both occurrences 
also occurs in some configuration C„ verifying 0. But </> is such that any modi- 
fication outside the segment between ci and in C„ does not change the fact 
that 4> if' satisfied provided no new c; and c^. colors are added. Therefore (j) is 
also satisfied by C and C €: X. □ 

The theorem gives the claimed separation result for subshifts of EMSO. 

Corollary 5.2 There are EMSO- definable subshifts which are not sofic. 

Proof. In the previous theorem, choose E, to be the complement of the set of 
integers m for which there is x such that machine m halts on empty input in 
less than x steps. E is not recursively enumerable and, using the construction of 
the proof above, it is reducible to the forbidden language of an EMSO-definable 
subshift. □ 



6 A Characterization of EMSO 

EMSO-definable sets are projections of FO-definable sets (proposition 2.2). Be- 
sides, sofic subshifts are projections of subshifts of finite type (or tilings). Pre- 
vious results show that the correspondence sofic^EMSO fails. However, we will 
show in this section how EMSO can be characterized through projections of 
"locally checkable" configurations. 

Corollary 3.4 expresses that FO-dcfinablc sets are essentially captured by 
counting occurrences of patterns up to some value. The key idea in the following 
is that this counting can be achieved by local checkings (equivalently, by tiling 
constraints) , provided it is limited to a finite and explicitly delimited region. This 
idea was successfully used in [7] in the context of picture languages: pictures are 
rectangular finite patterns with a border made explicit using a special state 
(which occiirs all along the border and nowhere else) . We will proceed here quite 
differently. Instead of putting special states on borders of some rectangular zone, 
we will simply require that two special subsets of states Qo and Q\ are present 
in the configuration: we call a {Qo,Qi)- marked configuration any configuration 
that contains both a color q G Qq and some color q' G Qi somewhere. By 
extension, given a subshift S over Q and two subsets Qo ^ Q and Qi C Q, the 
doubly-marked set Sq^^q-^ is the set of (Qo, Qi)-marked configurations of S. 
Finally, a doubly-marked set of finite type is a set ^Qq.Qi some SET S and 
some Qq,Qi. 

Lemma 6.1 For any finite pattern P and any k > 0, S-k{P) is the projection 
of some doubly-marked set of finite type. The same result holds for S>k{P). 



Moreover, any positive combination (union and intersection) of projections 
of doubly-marked sets of finite type is also the projection of some doubly-marked 
sets of finite type. 

Proof (sketch). We consider some base alphabet Q, some pattern P and some 
A: > 0. We will build a doubly-marked set of finite type over alphabet Q' — Q x Q+ 
and then project back on Q. Q+ is itself a product of different layers. The first 
layer can take values {0,1,2} and is devoted to the definition of the marker 
subsets Qo and Qi: a state is in Qi for i G {0, 1} if and only if its value on the 
layer is i. 

We first show how to convert the apparition in a configuration of two marked 
positions, by Qo and Qi, into a locally identifiable rectangular zone. The zone is 
defined by two opposite corners corresponding to an occurrence of some state of 
Qo and Qi respectively. This can be done using only finite type constraints as 
follows. By adding a new layer of states, one can ensure that there is a unique 
occurrence of a state of Qo and maintain everywhere the following information: 

1. Nq^ (z) = the position z is at the north of the (unique) occurrence of a state 



2. Eq^{z) = the position z is at the east of the occurrence of a state from Qq. 

The same can be done for Qi. From that, the membership to the rectangular 
zone is defined at any position z by the following predicate (see figure 6): 



Figures. The rectangular zone in dark gray defined by predicate Z[z). 

We can also define locally the border of the zone: precisely, cells not in the 
zone but adjacent to it. Now define P{z) to be true if and only if z is the lower- 
left position in an occurrence of the pattern P. We add k new layers, each one 
storing (among other things) a predicate Ci (z) verifying 



from Qo, 



Z{z) = Nq,{z) ^ Nq,{z) a Eq„{z) + Eq,{z). 




Qi 



Ci{z) =^ Z{z) A P{z) A /\ -Cj(z). 



Moreover, on each layer i, we enforce that exactly 1 position z verifies Ci{z): this 
can be done by maintaining north/south and east/west tags (as for Qq above) 
and requiring that the north (resp. south) border of the rectangular zone sees 
only the north (resp. south) tag and the same for east /west. Finally, we add the 
constraint: 

P{z)AZ{z)^\/Ci 

i 

expressing that each occurrence of P in the zone mut be "marked" by some Cj. 

Hence, the only admissible {Qa, Qi)-marked configurations are those whose rect- 
angular zone contains exactly k occurrences of pattern P. We thus obtain exactly 
S>k{P) after projection. To obtain 5=fc(P), it suffices to add the constraint: 

Piz) ^ Z{z) 

in order to forbid occurrences of P outside the rectangular zone. 

To conclude the proof we show that finite unions or intersections of projec- 
tions of doubly-marked sets of finite type are also projections of doubly-marked 
sets of finite type. Consider two SFT X over Q and Y over Q' and two pairs of 
marker subsets Qq, Qi C Q and Qq, Q[ C Q'. Let ni : Q ^ A and ^2 ■ Q' ^ A 
be two projections. 

First, for the case of union, we can suppose (up to renaming of states) that 
Q and Q' are disjoint and define the SFT S over alphabet QuQ' as follows: 

— 2 adjacent positions must be both in Q or both in Q'; 

— any pattern forbidden in X or y is forbidden in S. 

Clearly, Tr{SQ„uQ'„.QiUQO = MXqo,Qi) ^ M^Q'^^qO where 7r(g) is 7ri(g) when 
q & Q and 772(9) else. 

Now, for intersections, consider the SFT S over the fiber product 

Qx = {(q,q') e Q X Q'lTTiiq) = 7r2{q')} 

and defined as follows: a pattern is forbidden if its projection on the component 
Q (resp. Q') is forbidden in X (resp. Y); 

If we define tt as tti applied to the Q-component of states, and if E is the 
set of configuration of S such that states from Qo and Qi appear on the first 
component and states from Q'q and Q[ appear on the second one, then we have: 

TT{E)=7Tl{XQ„^Q,)Un2(YQ,^^Q'J. 

To conclude the proof, it is sufficient to obtain E as the projection of some 
doubly-marked set of finite type. This can be done starting from E and adding 
a new component of states whose behaviour is to define a zone from two markers 
(as in the first part of this proof) and check that the zone contains occurrences 
of Qo, Qi, Qq and Q[ in the appropriate components. □ 

Theorem 6.2 A set is EMSO-definable if and only if it is the projection of a 
doubly-marked set of finite type. 



Proof. First, a doubly- marked set of finite type is an FO-definable set because 
SFT are FO-definable (theorem 4.1) and the restriction to doubly-marked con- 
figurations can be expressed through a simple existential FO formula. Thus the 
projection of a doubly-marked set of finite type is EMSO-definable. 

The opposite direction follows immediately from proposition 2.2 and corol- 
lary 3.4 and the lemma above. □ 

At this point, one could wonder whether considering simply-marked set of 
finite type is sufficient to capture EMSO via projections. In fact the presence of 
2 markers is necessary in the above theorem: considering the set Sqo,Qi where 
E is the full shift and Qo and Qi arc distinct singleton subsets of Q, a 
simple compactness argument allows to show that it is not the projection of any 
simply-marked set of finite type. 

7 Open Problems 

— Is the second order alternation hierarchy strict for MSO (considering our 

model-theoretic equivalence)? 

— One can prove that theorem 4.1 also holds for formulas of the form: 

yxi...yxn,yz, i;{z,Xi...Xn) 

where ip is q\iantificr-frcc. Hence, adding universal second-order quantifiers 
does not increase the expression power of formulas of theorem 4.1. More 
generally, let C be the class of formulas of the form 

VXi, 3X2, . . . , V/3X„, Vzi, . . .,Wzp, 4>{Xi,. . . ,Xn,zi, . . . ,Zp). 

One can check that any formula in C defines a subshift. Is the second-order 
quantifiers alternation hierarchy strict in C? On the contrary, do all formulas 
in C represent sofic subshifts ? 
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